<!DOCTYPE html SYSTEM "about:legacy-compat">
<html manifest="pamflet.manifest">
      <head>
        <title>MongoDB+Hadoop Connector — Hadoop Streaming Support</title>
        <link type="text/css" media="screen, projection" rel="stylesheet" href="css/blueprint/screen.css"></link>
        <link type="text/css" media="screen and (min-device-width: 800px), projection" rel="stylesheet" href="css/blueprint/grid.css"></link>
        <link type="text/css" media="print" rel="stylesheet" href="css/blueprint/print.css"></link> 
        <!--[if lt IE 8]>
          <link rel="stylesheet" href="css/blueprint/ie.css" type="text/css" media="screen, projection"/>
        <![endif]-->
        <link type="text/css" media="screen, projection" rel="stylesheet" href="css/pamflet.css"></link>
        <link type="text/css" media="print" rel="stylesheet" href="css/pamflet-print.css"></link>
        <link type="text/css" media="screen and (min-device-width: 800px), projection" rel="stylesheet" href="css/pamflet-grid.css"></link>
        
        <script src="js/jquery-1.6.2.min.js"></script>
        <script src="js/jquery.collapse.js"></script>
        <script src="js/pamflet.js"></script>
        
        <meta charset="utf-8"></meta>
        <meta name="viewport" content="width=device-width, initial-scale=1"></meta>
      </head>
      <body>
        <a class="page prev nav" href="Configuration+%26+Behavior.html">
            <span class="space">&nbsp;</span>
            <span class="flip">❧</span>
          </a><a class="page next nav" href="Building+Hadoop+Streaming+Support.html">
            <span class="space">&nbsp;</span>
            <span>❧</span>
          </a>
        <div class="container">
          <div class="span-16 prepend-1 append-1">
            <div class="top nav span-16 title">
              <span>MongoDB+Hadoop Connector</span> — Hadoop Streaming Support
            </div>
          </div>
          <div class="span-16 prepend-1 append-1 contents">
            <h1 id="Hadoop+Streaming+Support">Hadoop Streaming Support</h1><p><a title="Hadoop Streaming" href="http://hadoop.apache.org/common/docs/current/streaming.html">Hadoop Streaming</a> is an add-on module to Hadoop which provides a facility for composing Hadoop jobs in non-JVM languages. With streaming, any programming language (or tool) which supports access to <strong>STDIN</strong> and <strong>STDOUT</strong> can be leveraged for data processing. To that end, an adapter for using MongoDB with Hadoop Streaming is provided in this package (as long as the build of Hadoop you are using supports the Streaming features we require).
</p><p>As working with MongoDB requires the ability to manipulate <a  href="http://bsonspec.org">BSON</a>, the options for languages to use with <em>mongo-hadoop-streaming</em> are somewhat more limited.  At this time, we provide a support module to work with Python, and supporting additional languages (such as Ruby) is planned for a future release.
</p><h4 id="Availability+of+MongoDB+Streaming+Support">Availability of MongoDB Streaming Support</h4><p>Due to the manner Hadoop Streaming is used, <em>mongo-hadoop-streaming</em> requires the construction of a “fat” (sometimes known as a “shaded” or “assembly”) jar, which contains all of its dependencies including the Java Driver.  This limits our ability to publish Streaming support to Maven.  Instead, if you’d like to leverage <em>mongo-hadoop-streaming</em> you should build it yourself, or download a preassembled jar from our <a  href="https://github.com/mongodb/mongo-hadoop">Github Page</a>.
</p><div class="tocwrapper show">
      <a class="tochead nav" style="display: none" href="#toc">❦</a>
      <a name="toc"></a>
      <h4 class="toctitle">Contents</h4>
      <div class="tocbody">
      <div><a href="MongoDB%2BHadoop+Connector.html">MongoDB+Hadoop Connector</a></div><ol class="toc"> <li><div><a href="Frequently+Asked+Questions.html">Frequently Asked Questions</a></div></li><li><div><a href="Getting+Started.html">Getting Started</a></div><ol class="toc"> <li><div><a href="Building+the+Adapter.html">Building the Adapter</a></div></li><li><div><a href="Configuration+%26+Behavior.html">Configuration &amp; Behavior</a></div></li> </ol></li><li><div class="current">Hadoop Streaming Support</div><ol class="toc"> <li><div><a href="Building+Hadoop+Streaming+Support.html">Building Hadoop Streaming Support</a></div></li> </ol></li><li class="generated"><div><a href="Contents+in+Depth.html">Contents in Depth</a></div></li><li class="generated"><div><a href="Combined+Pages.html">Combined Pages</a></div></li> </ol></div></div>
          </div>
        </div>
        <a class="fork nav" href="http://github.com/mongodb/mongo-hadoop"><img alt="Fork me on GitHub" src="img/fork.png"></img></a>
      </body>
    </html>